Rank | Count | Beginning |
---|---|---|
2909 | 908 | Die |
2385 | 459 | Der |
1743 | 411 | Das |
4609 | 221 | Es |
5939 | 211 | In |
5530 | 173 | Ich |
8633 | 167 | Und |
7970 | 141 | Sie |
246 | 140 | Alle |
5781 | 132 | Im |
737 | 126 | Auch |
11 | 118 | Aber |
6884 | 115 | Mit |
4452 | 110 | Er |
1039 | 100 | Bei |
4193 | 95 | Ein |
9328 | 89 | Wenn |
5022 | 81 | Für |
8150 | 81 | So |
7052 | 78 | Nach |
517 | 75 | Am |
4223 | 70 | Eine |
9510 | 69 | Wie |
4018 | 68 | Doch |
9604 | 62 | Wir |
1507 | 60 | Da |
9173 | 60 | Was |
427 | 59 | Als |
3627 | 59 | Diese |
1600 | 58 | Damit |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV